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TITLE: Soft, Prioritised Early Packet Discard System 
FIELD OF THE INVENTION 

This invention generally relates to systems of discarding cells during congestion in an 
5 Asynchronous Transfer Mode (ATM) network, and is particularly concerned with a system 
based on Soft, Prioritised Early Packet Discard (SPEPD) method. 

DESCRIPTION OF RELATED ART 

As a result of the bursty nature of data traffic, and the fact that they have high propagation- 
's delay products, future terrestrial and satellite Asynchronous Transfer Mode (ATM) networks 
]?10 will not be immune to congestion situations. As such, it is crucial that an effective 
2* congestion control method be implemented in such networks. 

Depending upon channel conditions, congestion duration can vary from cell-level times, i.e. 
- the length of time required to transmit a cell, to call-level times, i.e. the time it takes to 

hj transmit a call. Specific to cell-level congestion, a combination of a queuing discipline and a 
•^15 cell discard scheme is commonly utilized to minimize the intensity and duration of 
y3 congestion. 

A wealth of cell-discard methods is known in the art. Some of the relevant discard methods 
are described in the prior art references listed further below. The simplest of such methods is 
the Discard Tail (DT) method, as described in reference [1], where the switch simply discards 
20 cells arriving to a full buffer. A major weakness of the DT method is that receivers tend to 
request retransmission of incomplete packets, causing the retransmission of all the cells 
belonging to the packets that have some of their contents in the dropped cells. The request 
for retransmission occurs while the channel is still congested, introducing even more traffic 
into an already congested channel. 

25 A Partial Packet Discard (PPD) method is described in reference [2], which is designed to 

alleviate the negative effect of re-transmission. In the PPD method, when a cell is discarded, 
subsequent cells of the same packet are also discarded, as a result, more buffer space is made 
available for whole packets. However, this does nothing to the rest of the cells in a packet if 
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they are already in the queue's buffer memory. In a worst-case scenario, the last cell in a 
packet is the one that is dropped, meaning that an almost complete packet is transmitted 
immediately before a request is made for its retransmission. 

An Early Packet Discard (EPD) method is described in reference [3], which enhances the 
5 PPD method by discarding whole packets, instead of partial ones, so that buffer space is used 
only by whole packets. In the EPD method, a fixed threshold on the queue size is set, above 
which the switch starts to discard complete packets. Although in the majority of cases the 
EPD method performs better than the PPD method, its performance depends on proper 
^ selection of a queue threshold level. The queue threshold should be selected to avoid wasting 
CjlO buffer space, which happens if the threshold is set too low, or the partial discarding of a 
Zj. packet, which happens if the threshold is set too high, assuming the PPD method is executed 
H when the buffer is full. The expected number of active connections and the average packet 
K? size typically determine this threshold. Unfortunately, both these numbers are difficult to 

estimate in practice. The use of a fixed threshold has another disadvantage when using 
!r:15 Transmission Control Protocol (TCP). TCP detects congestion only after a packet has been 
ry dropped. Due to the fixed threshold used in EPD, the switch tends to discard packets 
% successively, resulting in many TCP connections reducing their window size simultaneously, 
i.e. global synchronization. Since the speed of TCP congestion recovery is heavily dependent 
on the round-trip delay, the effect of global synchronization on throughput is more significant 
20 in satellite networks and in high-speed, long-delay terrestrial networks. 

A Random Early Discard (RED) method is described in both references [4] and [5]. The 
RED method is designed for networks, whose underlying transport protocol is TCP-like, i.e. 
where a dropped packet is sufficient to indicate congestion to the source. Due to the growth 
of the Internet, an increased amount of traffic carried by current and future ATM networks 

25 will be Internet Protocol (IP) traffic, and by extension TCP traffic as well. Therefore, the 
RED method has been considered for implementation in ATM switches, as described in 
reference [6]. In the RED method, the switch randomly chooses packets to discard when the 
average queue size is between a minimum and a maximum queue thresholds. With RED, the 
number of packets discarded is proportional to the average queue size. The RED method 

30 successfully avoids global synchronization problems and maintains a low average queue size. 
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A major weakness of the RED method is that its algorithm is computation-intensive. 
Furthermore, anticipating that the distribution of traffic is random, it is often unnecessary to 
perform computationally expensive calculations of drop probability to further randomize the 
discarding of packets. 

5 All of the above mentioned methods share a common problem of unfairness. Specifically, 
assuming that both non-TCP-like and TCP-like connections share the same service class and 
that per-class queuing is used in the switch, the throughput of non-TCP-like connections is 
expected to be greater than that of TCP-like connections. 

An EPD method with Selective Discard (SD) or Fair Buffer Allocation (FBA), is described in 
10 reference [7]. While this approach may successfully avoid the problem of unfairness, it does 
so at the cost of an increase in both memory (and memory bandwidth) requirements and 
computational complexity due to the use of per- Virtual-Circuit (VC) accounting. A Flow 
RED method, which is described in reference [8], uses a similar approach to provide equal 
treatment to non-TCP-like and TCP-like connections, but also imposes a computational 
15 penalty in doing so. 

There have been attempts to solve some of the existing problems by implementing a variable 
threshold to the standard discard methods. One such example is disclosed in US Patent 
5901 147, which adjusts the discard threshold as a function of the number of queues in use, 
and the occupancy of those queues. This method however still uses the standard EPD 
20 techniques to discard complete packets or frames whenever the threshold value is exceeded. 
This still results in the global TCP synchronization problem resulting in lowered TCP 
throughput mentioned earlier. 

A second example of a method utilizing a variable threshold is that disclosed in US Patent 
5936939, which uses statistics such as the arrival and departure rate to set the threshold level. 
25 This method also uses the typical EPD methodology, which causes the aforementioned TCP 
related problems. 
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A third example of a variable threshold method is disclosed in US Patent 5426640, which 
provides an end-to-end congestion control scheme. In this scheme, packet discards are 
spread over many nodes in a given path so as to reduce overall congestion. Though this does 
not directly address TCP concerns it may in fact resolve some of these issues, but the link 
5 between the method and objective has not yet been established. A drawback of this system is 
that it is an end-to-end method that requires control and management of every node in the 
system. As a result it cannot be implemented at a given node on the network without 
controlling the rest of the nodes. 

There is therefore a need for a fast, simple and fair packet discard method, which is more 
10 effective for TCP-like connections and for satellite and very-high-speed terrestrial 
communication environment with mixed types of traffic. 

Sources of relevant prior art information include the following: 

[1] Hashem E., "Analysis of Random Drop for Gateway Congestion Control", Report LCS 
TR-465, Laboratory for Computer Science, MIT, Cambridge, MA, 1989; 
1 5 [2] G. J. Armitage, "Packet Reassembly During Cell Loss", IEEE Network, Vol. 7, No. 9, 
September 1993; 

[3] Allyn Romanov and Sally Floyd, "Dynamics of TCP Traffic over ATM Network", 
IEEE JSAC, Vol. 13 No. 4, May 1995, pg. 633-641; 

[4] Sally Floyd and Van Jacobson, "Random Early Detection Gateways for Congestion 
20 Avoidance", IEEE/ACM Transactions on Networking. August 1993; 

[5] B. Braden et. Al, "Recommendations on Queue Management and Congestion 
Avoidance in the Internet", RFC 2309, April 1998; 

[6] Omar Elloumi and Hossam Afifi, "RED Algorithm in ATM Networks" Technical 
Report, June 1997; 

25 [7] Rohit Goyal, "Traffic Management for TCP/IP over Asynchronous Transfer Mode 

(ATM) Networks", Ph.D. Dissertation, Department of Computer and Information Science, 
The Ohio State University, 1999; 

[8] Dong Lin and Robert Morris, "Dynamics of Random Early Detection", Proceedings of 
the ACM SIGCOMM '97, September 1997; and 
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[9] Jun Huang, "Soft-Bounded Policy for Congestion Control of Communication Network," 
IEEE PRC on CCSP, Vancouver, Canada, 1991 . 

For reader's convenience, a glossary of acronyms used in this description is given below. 





ABR 


Available Bit Rate 




ATM 


Asynchronous Transfer Mode 


CLR 
EOP 


Cell Loss Ratio 
End of Packet 




EPD 


* Early Packet Discard 




FBA 


Fair Buffer Allocation 


FTP 


File Transfer Protocol 




GEO 


Geo-stationary 




GFR 


Guaranteed Frame Rate 




IP 


Internet Protocol 


m 


MSB 


Most Significant Bit 


ry 


PPD 


Partial Packet Discard 




QoS 


Quality of Service 




RED 


Random Early Discard 




RFC 


Request For Comment 




SD 


Selective Discard 




SPEPD 


Soft, Prioritised Early Packet Discard 




TCP 


Transmission Control Protocol 




UBR 


Unspecified Bit Rate 




VBR 


Variable Bit Rate 




VPN 


Virtual Private Network 




WUBR 


Weighted Unspecified Bit Rate 



SUMMARY OF THE INVENTION 

In packet based and other connectionless networks, such as computer networks, conventional 
congestion control methods are based on selectively discarding packets or equivalently 
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adjusting buffer size according to the level of congestion. Under the normal loading 
circumstances of transmission systems no packets are discarded, but under heavy traffic 
conditions packet loss is common, and the levels of packet loss depend upon the bandwidth 
required by the traffic. Similarly for connection-oriented networks, congestion control 
5 methods are partially based on denying new calls as a result of the high level of congestion in 
the system. 

Congestion in most networks is a dynamic phenomenon, not a static resource issue. Packet 
loss due to buffer shortage and packet delay due to buffer sufficiency are symptoms, not 
causes of congestion. The causes of congestion can be found in the stochastic properties of 
10 the traffic. If the system traffic can be deterministically modelled, congestion problems 
would not occur. 

Therefore, it is an object of this invention to provide a packet discard method that will 
facilitate the creation of the dynamic resources needed to remedy the problem associated with 
prior art solutions. Another object of this invention is to reduce the fluctuation of the demand 

15 for the aforementioned resources, instead of simply attempting to create a resource or 

reducing demands. It is yet another object of this invention to provide a better efficiency than 
the conventional EPD method. A further object of this invention is to solve the throughput- 
reducing global synchronization problem associated with TCP. In addition, it is an object of 
this invention to introduce a priority-based method to better match the Quality of Service 

20 (QoS) requirements associated with a service class. Finally, it is an object of this invention to 
provide a Soft Prioritised Early Packet Discard (SPEPD) method, which combines the 
simplicity of EPD and the effectiveness of RED for TCP-like connections. 

In accordance with an aspect of the present invention there is provided a system for 
controlling traffic congestion within a buffered data switching network having a 
25 predetermined total buffer size, said system comprising: 

(a) a packet counter for counting the number of newly arriving packets; and 

(b) threshold means for setting a packet-count threshold; 

wherein when the number of newly arriving packets reaches the packet-count threshold and 
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when the average queue size exceeds the congestion threshold, a packet is discarded and the 
packet counter is reset to a zero count. 

In an embodiment of the present invention, the system further comprises calculation means 
for calculating an average queue size to be used by the threshold means in setting the packet- 
5 count threshold. Preferably, the calculation means regularly updates the average queue size 
using an exponential averaging technique, wherein the average queue size at time t is 
calculated as: 

Qt = Qt-\ x ( l - Alpha)+ Q t x Alpha , 

where Q x is an instantaneous queue size and Q t -i is the average queue size at time t-1, and 
10 Alpha is a queue-length averaging parameter assigned a value between zero and one. A 
progressively increasing value of Alpha is assigned with increasing level of traffic 
congestion, as indicated by the instantaneous queue size. 

Conveniently, the average queue size is updated after a predetermined number of cells have 
arrived since a previous packet discard. Alternatively, the average queue size is updated after 
15 a predetermined period of time has elapsed since a previous packet discard. 

A preferred embodiment further comprises means for dividing the total queue size into a pre- 
selected number of N regions, wherein the threshold means sets the packet-count threshold by 
using a descending staircase function F(n), such that one of every F(n) packets is discarded, 
when the average queue size is in a buffer region n, 1 < n < N. This preferred embodiment 

20 further comprises means for detecting traffic congestion by setting a congestion threshold and 
comparing the average queue size with the congestion threshold, such that a congestion 
condition is indicated by the average queue size being equal to or above the congestion 
threshold, and an absence of congestion is indicated otherwise. The packet is discarded only 
during the congestion condition. The packet counter begins to operate when traffic 

25 congestion is detected, and halts operation when an absence of traffic congestion is detected. 

An alternative embodiment further comprises means for dividing the total queue size into a 
pre-selected number of M regions, for high-priority traffic defining a high-priority congestion 
threshold, and a pre-selected number of N regions for low-priority traffic defining a low- 
priority congestion threshold, wherein the threshold means sets the packet-count threshold by 
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using two functions F(n,m) and F(m), such that: 

when the average queue size of high-priority traffic is above the high-priority 
congestion threshold and is in the buffer region m, 1 < m < M 5 one of every F(m) high 
priority packets is discarded; and 

when the average queue size of low-priority traffic is above the low-priority congestion 
threshold and is in the buffer region n, 1 < n < N, one of every F(n,m) low priority packets is 
discarded. The function F(m) is a descending staircase function in the buffer region m, and 
the function F(n,m), is a multivariable function of m and n, which has a descending staircase 
behaviour in the buffer region n for a fixed value of m. 

Optionally, the system further comprises means for applying a priority scheme for discarding 
packets, which provides a differentiated service among service classes sharing a common 
buffer. 

Also optionally, the threshold means uses a look-up table, and sets the packet-count threshold 
upon arrival of a new packet into the system. Alternatively, the threshold means sets the 
packet-count threshold upon departure of a packet from the system. 

In accordance with another aspect of the present invention, there is provided a method for 
controlling traffic congestion within a buffered data switching network having a 
predetermined total buffer size, said method comprising the steps of: 

(a) counting the number of newly arriving packets; 

(b) setting a packet-count threshold; and 

(c) discarding a packet and resetting the packet-counter, when the number of newly arriving 
packets reaches the packet-count threshold and the average queue size exceeds the congestion 
threshold. 

In another embodiment of the present invention, the method further comprises a step of 
calculating an average queue size to be used by the step of setting the packet-count threshold. 
Preferably, the calculating step regularly updates the average queue size using an exponential 
averaging technique, wherein the average queue size at time t is calculated as: 

Qt =Qt-\ Alpha) +Q t x Alpha, 

where Q t is an instantaneous queue size and Q t -i is the average queue size at time t-1, and 



8 

9 



Alpha is a queue-length averaging parameter assigned a value between zero and one. A 
progressively increasing value of Alpha is assigned with increasing level of traffic 
congestion, as indicated by the instantaneous queue size. 



Conveniently, the average queue size is updated after a predetermined number of cells have 
arrived since a previous packet discard. Alternatively, the average queue size is updated after 
a predetermined period of time has elapsed since a previous packet discard. 

Another preferred embodiment further comprises a step of dividing the total buffer size into a 
pre-selected number of N regions, wherein the setting step sets the packet-count threshold by 
using a descending staircase function F(n), such that one of every F(n) packets is discarded, 
when the average queue size is in a buffer region n, 1 < n < N. This preferred embodiment 
further comprises a step of detecting traffic congestion by setting a congestion threshold and 
comparing the average queue size with the congestion threshold, such that a congestion 
condition is indicated by the average queue size being above the congestion threshold, and an 
absence of congestion is indicated otherwise. The packet is discarded only during the 
congestion condition. The packet counter begins to operate when traffic congestion is 
detected, and halts operation when an absence of traffic congestion is detected. 

Another alternative embodiment further comprises a step of dividing the total buffer size into 
a pre-selected number of M regions, for high-priority traffic defining a high-priority 
congestion threshold, and a pre-selected number of N regions for low-priority traffic defining 
a low-priority congestion threshold, wherein the setting step sets the packet-count threshold 
by using two functions F(n,m) and F(m), such that: 

when the average queue size of high-priority traffic is above the high-priority 
congestion threshold and is in the buffer region m, 1 < m < M, one of every F(m) high 
priority packets is discarded; and 

when the average queue size of low-priority traffic is above the low -priority congestion 
threshold and is in the buffer region n, 1 < n < N, one of every F(n,m) low priority packets is 
discarded. Here, the function F(m) is a descending staircase function in the buffer region m, 
and the function F(n,m), is a multivariable function of m and n, which has a descending 
staircase behaviour in the buffer region n for a fixed value of m. 
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Optionally, the method further comprises a step of applying a priority scheme for discarding 
packets, which provides a differentiated service among service classes sharing a common 
buffer. 

The SPEPD method is a congestion control method suitable for satellite onboard switch and 
5 very-high-speed terrestrial switch implementation, but its principle can be applied in various 
switches, terminals and gateways. The method is especially designed for the access cards, 
i.e. input or output interface cards, of the satellite onboard switch where design simplicity is a 
critical requirement. SPEPD is intended for Guaranteed Frame Rate (GFR), Available Bit 
Rate (ABR), Unspecified Bit Rate (UBR) and the forthcoming Weighted UBR (WUBR) 
10 service classes. 

The SPEPD method is capable of reducing the amount of buffer space required on a switch, 
hence the cost of a switch, with its active buffer management while providing better 
performance than the conventional EPD method. Furthermore, its RED-like method brings 
about improved TCP throughput. In addition, the SPEPD method provides a general-purpose 
1 5 solution, which is effective for mixed-traffic environment due to its use of priority and its 
non-TCP-specific method. 

The SPEPD method offers four main advantages over conventional methods: 

a) fast-response active queue management system; 

b) simple hardware implementation; 

20 c) packet-level prioritised discard scheme; and 
d) flexibility. 

Unlike the RED method, the SPEPD method uses a progressively higher exponential queue- 
length averaging parameter for higher instantaneous queue length, a strategy that results in a 
faster reaction to congestion situations, which is critical for a high-speed switch or a switch 
25 with a very limited buffer memory. Furthermore, the SPEPD method uses a more regular 
means of updating the average queue size. This strategy provides several advantages to the 
RED strategy of updating the average queue size at every packet arrival, i.e. at the arrival of 
the start- or end-of-packet cell, including: 
i) faster reaction to congestion situation; 

10 
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ii) more regular updates of average queue size to avoid sudden jumps or drops in the 
average value; and 

iii) avoiding a worst case situation where updates have to be performed rapidly due to 
successive arrivals of start- or end-of-packet cells. 

5 Unlike the random drop strategy of RED, the SPEPD method uses a strategy of dropping 
packets regularly which allows a simplified hardware implementation. As a result, the 
complexities of both the calculation of the random drop probability as well as the calculation 
used to determine whether or not to perform packet discard are greatly reduced. The risk of 
service unfairness from dropping packets regularly can be ignored for the following reasons. 

1 0 The risk of service unfairness, that is, the risk of discarding successive packets from a given 
connection, will be negligible in the environment where a soft prioritised early packet discard 
(SPEPD) system will typically find use, i.e. high-speed core network elements in broadband 
networks. In such an environment, the risk of service unfairness is negligible because the 
average buffer occupancy level, the size of packets and the number of active connections tend 

15 to fluctuate rapidly. Furthermore, in addition to the three above factors, the high level of 

multiplexing in such network elements results in a high probability that the arrival of packets 
is sufficiently randomised to become pseudo-random in nature and thereby avoid service 
unfairness. 

The SPEPD method introduces priority to provide differentiated service among service 
20 classes sharing a common buffer pool or among service groups sharing a common service 

class buffer space which allows a prioritised packet level discard. This strategy also provides 
a base for efficient buffer management for future applications and services, such as Virtual 
Private Networks (VPN). The SPEPD method can be optionally enhanced with per- Virtual 
Channel (VC) accounting to provide further fair treatment to both TCP-like and non-TCP- 
25 like connections. 

In SPEPD, queue-length averaging parameters and packet-count thresholds are implemented 
using a lookup table or tables. These parameters can then be readily fine-tuned to suit 
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observed traffic characteristics and available buffer space, to allow greater flexibility in the 
implementation. 

The SPEPD method also improves upon the RED method by halting the counting of packets 
while the average queue size is below the congestion threshold. In contrast, the strategy with 
5 RED is to reset the packet counter to zero whenever the average queue size value is below the 
minimum threshold. Halting the counting of packets with SPEPD improves upon the count- 
reset scheme used by RED by preventing the system from having an overly slow response to 
congestion. The halting of the packet count with SPEPD also prevents an overly rapid 
response to congestion caused by a continuing count of packets. 

10 A queue-length averaging parameter in the form of a weighting factor Alpha is used with 

both RED and SPEPD in the calculation of the average queue size. Optimal queue size varies 
. with several factors, including the level of congestion in the system, which makes a fixed 
Alpha undesirable. The SPEPD method uses a variable Alpha to allow a performance closer 
to optimal levels than the fixed Alpha value of RED would allow. 

15 The strategy for triggering the update of average queue size is another differentiating 

characteristic of SPEPD. With RED, the average queue size is updated whenever a new 
packet arrives, which is equivalent to updating the average queue size whenever a start-of- 
packet cell arrives. Therefore, when start-of-packet cells arrive successively, the average 
queue size value will be updated a number of times in a very short interval, bringing about 

20 rapid fluctuations in the value of the average queue size, which would diminish the accuracy 
of the average value. The SPEPD scheme avoids this problem by conducting more regular 
updates of the average value. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Exemplary embodiments of the invention will now be further described with references to the 
25 drawings in which same reference numerals designate similar parts throughout the figures 
thereof, and wherein: 

Figure 1 graphically illustrates a single variable threshold determination algorithm in 
accordance with an embodiment of this invention, wherein the total buffer size is divided into 
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TV regions, and the frequency of packet discard operations is depicted as a descending 
staircase function F(«), 1 < n< N; 

Figure 2 graphically illustrates a multivariable threshold determination algorithm in 
accordance with another embodiment of this invention, wherein the total buffer size is 
5 divided into M regions for high-priority traffic, and into TV regions for low-priority traffic, and 
the frequency of packet discard operations for low-priority traffic is depicted as a function 
F(n t m) y 1 < n< N and 1 < m < M\ 

Figure 3 illustrates in a flowchart, a Single-Priority SPEPD embodiment of this invention; 
Figure 4 illustrates, in a flowchart, details of the step of calculating the average queue length 
1 0 shown in Figure 3 ; 

Figure 5 illustrates in a block diagram a hardware embodiment of the SPEPD process shown 
in Figures 3 and 4; 

Figure 6 illustrates, in a high-level network diagram, a network set-up used for SPEPD 
simulations; 

15 Figure 7 illustrates, in a table, the parameters used in the SPEPD simulation, showing the 
correlation between either the queue size or the average queue size and Alpha values with 
varying Packet threshold values; 

Figure 8 illustrates, in a graph, the Cell Loss Ratio for TCP Traffic measured at the ATM 
switch when a conventional EPD scheme is used; 
20 Figure 9 illustrates, in a graph, the Cell Loss Ratio for TCP Traffic measured at the ATM 
switch when an SPEPD scheme is used in accordance with this invention; 
Figure 10 illustrates, in a graph, the TCP Delay as measured at the Server when a 
conventional EPD scheme is used; and 

Figure 1 1 illustrates, in a graph, the TCP Delay measured at the Server when an SPEPD 
25 scheme is used in accordance with this invention. 



13 




DETAILED DESCRIPTION OF THE INVENTION 

This invention modifies the Prior art method of soft-bounded congestion control, which is 
described in reference [9], in order to create a simple dynamic threshold EPD, herein referred 
to as Soft Prioritised Early Packet Discard (SPEPD). 

5 A formal description of the soft-bounded congestion control is as follows. Let K* denote the 
state of the network without the dynamic control of station capacities, then 



K* - . y k L *) where k* is the number of packets in the i ih station, 1 <i<L. After 



the control of the buffer size is applied, it becomes clear that all states represented by K* are 
not feasible. The normalization of the infeasible states is essential. The packet distribution in 
10 each state is adapted to the capacity limit of each station in the blocking network by the 



Markovian property of the packet flow is preserved. 

The modifications that are done to create a dynamic threshold EPD method in accordance 
15 with this invention are now described. The total buffer size is divided into TV regions. When 
the average queue size is above a threshold that indicates congestion, henceforth referred to 
as congestion threshold for conciseness, and is in the buffer region w, 1 < n < N, one of every 
F(n) packets will be discarded, where F(n) is a descending staircase function of n defining the 
frequency of packet discard operations, also referred to herein as packet-count threshold, in 
20 accordance with an embodiment of this invention using a single variable packet-count 

threshold determination algorithm. An exemplary embodiment of the function ¥(n) is given 
in Figure 1 which graphically illustrates the single variable threshold determination 
algorithm. This strategy of discarding packets regularly provides for simple hardware 
implementation. 

25 As shown in Figure 1, F(n) has high values for low average queue sizes. This is designed to 
address the problem of global synchronization of TCP connections. By allowing adequate 
spacing between two packet discards, it is likely that only one TCP connection will be forced 
to reduce its window size at a given time. As a result of this spacing, the overall TCP 



function f\k* ) = (&), where (k) is the normalised state for the blocking network. The 
function /(•) maps the non-feasible state (k*) to the feasible state (k) . As a result, the 
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throughput will not be significantly affected in the vast majority of situations. On the other 
hand, the function F(n) has lower values for higher average queue sizes, which is aimed at 
making a certain amount of buffer space available to store packets during bursty periods in 
transmissions. As a result, partial packet discards, which can quickly trigger the global 
5 synchronization problem, are avoided. 

In order to perform packet discard at regular intervals, SPEPD counts the number of packets 
that have arrived in the system since the last packet discard event. When the number of new 
packets is equal to the current value of the function F(w), the newly arriving packet is 
discarded. The counting of packets is halted whenever the average queue size is below the 
10 congestion threshold. 

The averaging of the queue size is achieved using an exponential averaging scheme similar to 
the one employed by RED. The average queue size Q t at time t is calculated using the 
function 



15 where Q t is the instantaneous queue size and Q t .\ is the average queue size at time M, and 
Alpha is a weighting factor assigned a value between zero and one, referred to herein as the 
queue-length averaging parameter, that expresses the importance of the instantaneous queue 
size with respect to the average queue size. A high Alpha value (close to 1) means the 
instantaneous value is weighted more heavily in the calculation of the average queue size 

20 value, which thereby reduces the importance of past instantaneous values. A low Alpha value 
(close to 0) increases the importance of past instantaneous values at the expense of current 
values. Thus, the degree of fluctuation in the average queue size value is significantly 
affected by the choice of Alpha, 

25 The SPEPD method uses a progressively increasing value of Alpha as congestion worsens. 
This varying weighting factor Alpha is a key strategy in this approach and it addresses the 
shortcomings of the RED method with respect to the speed of response to congestion 
situation. The SPEPD method uses a low value of Alpha when the instantaneous queue size 
is low, in order to diminish the role of the instantaneous queue value in the calculation so that 



Q t = Qt-\ x (l _ Alpha) +Q t x Alpha , 
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the system is able to react faster to more severely congested future situations. Conversely, the 
SPEPD method uses a value of Alpha approaching 1 when the instantaneous queue size is 
high. The high value of Alpha increases the importance of the instantaneous queue size value 
in the calculation so that the system is able to respond quickly to current congestion 
5 situations. Furthermore, the decrease in response times translates to a reduction in the 
amount of buffer space required for the SPEPD method to work effectively, which makes 
SPEPD suitable for environments where hardware resources are very limited, such as for a 
near-terabit terrestrial switch or a satellite onboard switch. 

The SPEPD method conducts updates of the average queue size value at regular intervals to 
10 avoid overly rapid fluctuations in queue size. There are two alternatives for achieving this 

goal, one is to perform an update after a predetermined number of cells (or packets in packet- 
based switches) have arrived since the previous packet discard event. The second alternative 
is to perform an update after a predetermined amount of time has elapsed since the previous 
packet discard event. Either alternative falls within the scope and nature of the invention. 

15 Two flowcharts describing an SPEPD embodiment using the first alternative of update 

triggering are given in Figures 3 and 4. As shown in Figure 3, a first step 1 is to receive a 
current cell and read its connection ID. The average queue length is determined by a second 
step 2. The process within the second step 2 is described in more depth by the flowchart 
illustrated in Figure 4. A determination of whether or not the packet is eligible for packet 

20 discard is made by a third step 3. If the cell is not eligible, a regular cell discard method is 
executed by a fourth step 4, and the process ends by a termination step 5. If otherwise, the 
cell is eligible for packet discard then the state of the connection is examined by a sixth step 6 
to see if it is IDLE. If the state of the connection is not IDLE, the system checks in a seventh 
step 7 to see if it is EPD. If otherwise the connection's state is not EPD, a check is performed 

25 by an eighth step 8 to see if the current cell is the end of a packet (EOP). If the current cell is 
not EOP, it is discarded by a ninth step 9 and the process ends by the termination step 5. If 
otherwise it is EOP, the state of the connection is set as IDLE by a tenth step 10, and a 
decision is made by an eleventh step 11 not to discard the current cell, and the process ends 
by the termination step 5. 
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If the seventh step 7 indicates that the state of the system is EPD, then the current cell is 
checked by a twelfth step 12 to see if it is EOP. If it is not EOP, the current cell is discarded 
by a thirteenth step 13 and the process ends by the termination step 5. If otherwise the cell is 
EOP, the connection state is set to IDLE by a fourteenth step 14, the cell is discarded by the 
5 thirteenth step 13 and the process ends by the termination step 5. 

If the sixth step 6 indicates that the connection state was IDLE when a packet entered the 
system and the cell is eligible for packet discard, a fifteenth step 15 checks if the previous cell 
in the connection was EOP. If not, a sixteenth step 16 examines the current cell to see if it is 
EOP. If not, a seventeenth step 17 checks for buffer starvation. If the buffer is not starved, 
10 the cell is discarded by an eighteenth step 18 when the connection's state is EPD, and the 
process ends by the termination step 5. If otherwise the buffer is starved, then a nineteenth 
step 19 discards the current cell, resets the packet count, and sets the connection's state to 
PPD, after which the process ends by the termination step 5. 

If the sixteenth step 16 indicates that the current cell is EOP, then a twentieth step 20 discards 
15 the cell when the connection's state is EPD, a twenty-fourth step 24 sets the connection's 
state to IDLE and the process ends with the termination step 5. 

If the fifteenth step 15 indicates that the previous cell in the connection was EOP, then a 
twenty- first step 21 increments the packet count if the average queue size is equal to or higher 
than the congestion threshold, and then obtains a packet-count threshold based on the average 

20 queue size. A twenty-second step 22 then compares the packet count with the packet-count 
threshold, and compares the average queue size with the congestion threshold, which is 
derived from the function F(w) as shown in Figure 1 . If either the packet count or the average 
queue size is less than its respective threshold, then the sixteenth step 16 examines the current 
cell to see whether or not it is EOP. If both the packet count and the average queue size is 

25 equal to or higher than its respective threshold, then a twenty-third step 23 sets the 

connection's state to EPD and resets the packet count to zero, which is then followed by the 
sixteenth step 16. Based on the outcome of the sixteenth step 16, one of the three alternative 
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paths described above is followed; namely the twentieth step 20 followed by the twenty- 
fourth step 24, or the seventeenth step 17 followed by either the eighteenth step 18 or the 
nineteenth step 19. All three such paths are then terminated by the termination step 5. 

Figure 4 illustrates in more details the process carried out by the second step 2 shown in 
5 Figure 3 that is used to determine the average queue length. The process begins with an 

incrementing step 25, which increments the cell count. A comparison step 26 then compares 
the cell count to the cell count threshold. If they are not equal, the process continues to the 
third step 3. If otherwise they are equal then a first computation step 27 obtains a queue- 
y length averaging parameter called Alpha from the instantaneous queue size. Then a second 
=p 10 computation step 28 calculates the average queue size value as the old value of the average 
q queue size times (1- Alpha) added to the instantaneous queue size value times Alpha. A 
2: resetting step 29 resets the cell counter to zero and the process continues to the third step 3. 

]\ In alternative embodiments, the SPEPD system is optionally augmented with a priority 

[U scheme allowing differentiated service among any one of the service classes sharing a 

jjj 1 5 common buffer pool, or alternatively among service groups sharing a common service class 

buffer space. This priority scheme provides a base for efficient buffer management for future 

applications and services, such as Virtual Private Networks (VPN). 

The following describes an exemplary embodiment that supports the sharing of buffer space 
between two priority classes of traffic in accordance with this invention. The total buffer size 

20 is divided into M regions for high-priority traffic, and into TV regions for low-priority traffic. 
Each of these two types of traffic has a corresponding congestion threshold. When the 
average queue size of high-priority traffic is above the corresponding congestion threshold 
and is in the buffer region m,l<m<M, one of every F(m) high-priority packets is discarded. 
Also, when the average queue size of low-priority traffic is above the corresponding 

25 congestion threshold and is in the buffer region n, 1 < n < N 9 one of every F(n,m) low-priority 
packets is discarded. Here, F(w) is a descending staircase function similar to that given in 
Figure 1, while the function F(n,m), as illustrated in Figure 2 is a multi variable function 




dependant upon both m and n. For a fixed value ofm, the function retains a descending 
staircase behaviour with varying n values. 

In order to illustrate a hardware embodiment of SPEPD, an implementation of the single- 
priority version of SPEPD with arrival-based triggering of queue-length averaging update is 
5 described in detail herein. The total buffer size is divided into N— 16 equal regions. The 
values of the function F(n) are denoted as PkCntThr (the packet-count threshold). This 
function is implemented as a lookup table to provide a reduced computational complexity for 
this embodiment, with the index of the lookup table being the four Most Significant Bits 
(MSB) of the average queue size. In this respect, the lookup table stores 2 4 = 16 elements. 

10 Next, each region is associated with a queue-length averaging parameter, denoted by Alpha, 
Respective values of this parameter are also stored in a lookup table, with the four MSBs of 
the instantaneous queue size used as the index of this table, to reduce the implementation 
computational complexity. Alpha is typically a small positive number less than one, example 
values of which are given in Figure 7. Appropriate values of Alpha are found either 

15 empirically or through software simulations. To simplify implementation, Alpha values are 
obtained as 2~ k , where k is an integer. Finally, a partial packet discard (PPD) is initiated when 
a cell is discarded due to buffer starvation. The aforementioned Figure 7 is a table that shows 
the correlation between either the queue size or the average queue size and Alpha values with 
varying Packet-count threshold values (F(n) or F(«, m) values) with a congestion threshold of 

20 25%, a cell count threshold of 210 cells, and a switch buffer size of 8192 cells per port. 

A connection table stores all relevant information about each connection. This information 
includes two state variables associated with SPEPD, which are maintained for each eligible 
connection. The first state variable is used to store the state of discard, namely IDLE, EPD 
and PPD, while the second state variable is used to indicate whether the last cell received 
25 from the connection is an end-of-packet cell. 

In addition, four global variables are maintained for each class of traffic to which the SPEPD 
scheme applies. The first global variable is used to count the number of cells queued, 
denoted as QSize in Figures 3 and 4. The second global variable is used to store the average 
number of cells queued, denoted as AvQSize in Figure 3 and 4. The third global variable is 
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used to count the number of packets that have entered the system since the last packet 
discard, denoted as PkCnt in the Figure 3. The fourth global variable is used to count the 
number of cell arrivals since the last queue-size averaging event, denoted as CICnt in Figure 
3 and 4. 

5 Finally, two fixed parameters are maintained for each class of traffic. The first fixed 
parameter is the congestion threshold, denoted as CongThr in Figure 3. The counting of 
packets is disabled when the average queue size is below this threshold. The second fixed 
parameter is the cell counter threshold, denoted as CICntThr in Figure 4. The new average 
queue size value is calculated when the number of cell arrivals is equal to this threshold. The 
10 flowchart of a single-priority SPEPD scheme is shown in Figure 3, which is described above. 
The calculation of average queue size is depicted by the flowchart shown in Figure 4, as 
described above. 

Figure 5 illustrates in a block diagram a hardware embodiment of the SPEPD process in 
accordance with this invention. In this embodiment, incoming traffic 31 is routed to a Write 

15 process 32, which appends internal headers to each cell, and obtains a buffer address with 
which to store the cell. The Write Process 32 provides an SPEPD block 33, which is 
described in detail in Figures 3 and 4, with the connection ID 32a of an arriving cell. In turn 
the SPEPD block 33 provides the Write process 32 with decision information 33a as to 
whether the cell should be buffered or discarded. A Buffer Manager 34 receives all the cells 

20 that are to be buffered 32b from the Write process 32, and then handles the overall 

accounting of buffer space. The Buffer Manager 34 provides the queue size 34a to the 
SPEPD block 33. The SPEPD block 33 causes the packet-count threshold information and 
queue-length averaging parameters 35a to be retrieved from a Lookup Table 35, in order to 
realize a better performance and avoid computationally expensive operations. These 

25 parameters are obtained from the four MSBs of the average and instantaneous queue sizes 
33b respectively. The Buffer Manager 34 passes the buffered version of its input 34b to the 
Read Process 36. The Read Process 36 handles any processing that is required for any 
outgoing cell. The Outgoing Traffic 36a is the final product of the SPEPD hardware 
embodiment. 
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To demonstrate the performance advantage of the SPEPD scheme of this invention over the 
conventional EPD scheme, a performance evaluation through software simulations has been 
conducted and relevant results are provided herein. The obtained simulation results are 
presented for a network as shown in Figure 6. Figure 6 illustrates a high level network 
5 diagram consisting of 15 networked workstations (Workstations 1-15) and a server, at 

different geographic locations, connected, bidirectionally, to an ATM switch using industry 
standard Tl links. 

In the simulation, each workstation generates TCP/IP traffic that incorporates industry 
standard File Transfer Protocol (FTP). Each FTP session connects a workstation to the 

10 server, through the ATM switch. Each workstation is set to generate 270 TCP sessions per 
hour, stochastically distributed according to a Poisson distribution. Each TCP session is an 
upload FTP session with an average upload size of 128 KB, stochastically distributed 
according to a Normal distribution. The TCP model used is based on the industry standards 
known as Request For Comment (RFC) 793 and RFC 1 122. The propagation delay in each 

15 Tl link connecting either a workstation or the server with the ATM switch is 125 ms, which 
is typical of ground-to-satellite delays for Geo-stationary (GEO) satellites. Finally, the 
average load on the link from the ATM switch to the server is approximately 76 %. 

Two different scenarios are considered. In the first scenario, the ATM switch employs the 
conventional EPD method and the congestion threshold for EPD is set to 75 % of the switch 
20 buffer size, which is 8192 cells per port. In the second scenario, the ATM switch employs the 
SPEPD method of this invention and the SPEPD parameters used in the simulation are shown 
in Figure 7, which is described earlier. 

Figures 8 and 9 depict the cell loss ratio for TCP traffic transported using Unspecified Bit 
Rate (UBR) service, and measured at the ATM switch for the two respective cases using the 
25 EPD and SPEPD methods. Figures 10 and 1 1 depict the average TCP delay measured at the 
server for the same two respective cases using the EPD and SPEPD methods. These four 
figures confirm the superior performance of the SPEPD method with respect to the 
conventional EPD techniques. By performing an earlier packet discard than in the EPD 
method, and by spacing packet discard over multiple packets, the SPEPD method avoids the 
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problem of global TCP synchronization, which results in shorter congestion periods, a 
reduced cell loss ratio by almost a factor of 2, and a lower TCP delay; The maximum TCP 
delay observed during simulation is close to 80 seconds for the case of EPD and close to 50 
seconds for the case of SPEPD. 

Due to its simple implementation and performance results that are superior to conventional 
techniques, it is foreseeable that the SPEPD method will find application in numerous 
telecommunication fields including Traffic Management, both terrestrial- and satellite-based 
ATM switches, and ATM Terminals and Gateways. 

Of course, numerous variations and adaptations may be made to the particular embodiments 
of the invention described above, without departing from the spirit and scope of the 
invention, which is defined in the claims. As an example, the SPEPD embodiments as 
described above use an arrival-based scheme, wherein the calculations of the conditions of 
the queues and the like are done upon the arrival of a new packet. Nonetheless, in an 
alternative embodiment using a departure-based scheme, the calculations are done at the 
departure of a packet, without departing from the main spirit and scope of the invention. 
Though performance may suffer slightly with this approach, a departure-based SPEPD 
method is feasible, and should be considered as an alternative embodiment of this invention. 




